Fuzzy Annotation of Web Data Tables Driven by a Domain Ontology

نویسندگان

  • Gaëlle Hignette
  • Patrice Buche
  • Juliette Dibie-Barthélemy
  • Ollivier Haemmerlé
چکیده

We propose an automatic system for annotating accurately data tables extracted from the web. This system is designed to provide additional data to an existing querying system called MIEL, which relies on a common vocabulary used to query local relational databases. We will use the same vocabulary, translated into an OWL ontology, to annotate the tables. Our annotation system is unsupervised. It uses only the knowledge defined in the ontology to automatically annotate the entire content of tables, using an aggregation approach: first annotate cells, then columns, then relations between those columns. The annotations are fuzzy: instead of linking an element of the table with a precise concept of the ontology, the elements of the table are annotated with several concepts, associated with their relevance degree. Our annotation process has been validated experimentally on scientific domains (microbial risk in food, chemical risk in food) and a technical domain (aeronautics).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Query Architecture Expansion in Web Using Fuzzy Multi Domain Ontology

Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...

متن کامل

Flexible SPARQL Querying of Web Data Tables Driven by an Ontology

This paper concerns the design of a workflow which permits to feed and query a data warehouse opened on the Web, driven by a domain ontology. This data warehouse has been built to enrich local data sources and is composed of data tables extracted from Web documents. We recall the main steps of our semi-automatic method to annotate Web data tables driven by a domain ontology. The output of this ...

متن کامل

Fuzzy Semantic Approach for Data Integration Applied to Risk in Food: an Example about the Cold Chain

A preliminary step to risk in food assessment is to gather experimental data. During the Sym’Previus project, we have designed a system for the integration of experimental data on food microbiology. Data provided by industrial partners and data extracted from experimental research results published in the main scientific journals of the domain are stored into a database. For that, data are inde...

متن کامل

Efficient Semantic Web Data Querying And Integration Using Fuzzy Ontology

1 M.S.Raghuram, 2 M.Thenmozhi 1 Research Scholar( M.Tech), 2 Assistant Professor, 1 Information Technology, Dept of Information Technology, SRM University, Chennai, India, Dept of Information Technology, SRM University, Chennai, India [email protected], [email protected] ________________________________________________________________________________________________________ Abstr...

متن کامل

Semantic annotation of Web data applied to risk in food.

A preliminary step to risk in food assessment is the gathering of experimental data. In the framework of the Sym'Previus project (http://www.symprevius.org), a complete data integration system has been designed, grouping data provided by industrial partners and data extracted from papers published in the main scientific journals of the domain. Those data have been classified by means of a prede...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009